APPRIS 2017: principal isoforms for multiple gene sets
نویسندگان
چکیده
The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the 'principal' isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants.
منابع مشابه
APPRIS WebServer and WebServices
This paper introduces the APPRIS WebServer (http://appris.bioinfo.cnio.es) and WebServices (http://apprisws.bioinfo.cnio.es). Both the web servers and the web services are based around the APPRIS Database, a database that presently houses annotations of splice isoforms for five different vertebrate genomes. The APPRIS WebServer and WebServices provide access to the computational methods impleme...
متن کاملAPPRIS: annotation of principal and alternative splice isoforms
Here, we present APPRIS (http://appris.bioinfo.cnio.es), a database that houses annotations of human splice isoforms. APPRIS has been designed to provide value to manual annotations of the human genome by adding reliable protein structural and functional data and information from cross-species conservation. The visual representation of the annotations provided by APPRIS for each gene allows ann...
متن کاملDEIsoM: a hierarchical Bayesian model for identifying differentially expressed isoforms using biological replicates
Motivation High-throughput mRNA sequencing (RNA-Seq) is a powerful tool for quantifying gene expression. Identification of transcript isoforms that are differentially expressed in different conditions, such as in patients and healthy subjects, can provide insights into the molecular basis of diseases. Current transcript quantification approaches, however, do not take advantage of the shared inf...
متن کاملMultiple Isoforms of ANRIL in Melanoma Cells: Structural Complexity Suggests Variations in Processing
The long non-coding RNA ANRIL, antisense to the CDKN2B locus, is transcribed from a gene that encompasses multiple disease-associated polymorphisms. Despite the identification of multiple isoforms of ANRIL, expression of certain transcripts has been found to be tissue-specific and the characterisation of ANRIL transcripts remains incomplete. Several functions have been associated with ANRIL. In...
متن کاملExpression Analysis of RNA-Binding Motif Gene on Y Chromosome (RBMY) Protein Isoforms in Testis Tissue and a Testicular Germ Cell Cancer-Derived Cell Line (NT2)
a key factor in spermatogenesis and disorders associated with this protein have been recognized to be related to male infertility. Although it was suggested that this protein could have different functions during germ cell development, no studies have been conducted to uncover the mechanism of this potential function yet. Here, we analyzed the expression pattern of RBMY protein isoforms in test...
متن کامل